Fluctuation of the download network 
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The scaling behavior of fluctuation for a download network which we have investigated a few years 
ago based upon Zhang's Encophysics web page has been presented. A power law scaling, namely 
a ~ (/)" exists between the dispersion a and average flux (/) of the download rates. The fluctuation 
exponent a is neither 1/2 nor 1 which was claimed as two universal fluctuation classes in previous 
publication, instead it varies from 1/2 to 1 with the time window in which the download data were 
accumulated. The crossover behavior of fluctuation exponents can be qualitatively understood by 
the external driving fluctuation model for a small-size system or a network traffic model which 
suggests congestion as the origin. 
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Many phenomenological and statistical analysis have 
been made for the complex networks P, [2] . In those re- 
searches, most studies focused on the long-time behavior 
of a certain complex network. In this sense, the feature of 
the network corresponds to its static characteristic. How- 
ever, time evolution of the network topology is also very 
important. During its evolution, the network nodes ex- 
perience different traffic flux time by time and the fluctu- 
ation is unavoidable. Actually, fluctuation is a universal 
phenomenon which exists in many different fields, such 
as nuclear fragmentation or hadron production [3, 13j : 
which can also be related to the critical behavior or self- 
organized criticality. For instance, the dispersion (cr) of 
an order parameter, such as the charge of the largest 
fragments in nuclear fragmentation, shows a transition 
from cr cx (/)^^^ (the ordered phase) to cr oc (/) (the dis- 
ordered phase) when the multifragmentation phase tran- 
sition takes place in hot nuclear system [sl, !4| (here (/) 
is the average of the order parameter). For network dy- 
namics, recently, Menezes and Barabasi investigated the 
fluctuation in a number of real world networks, which 
includes internet, river network, microchip, WWW and 
highway network dynamics and presented a model to un- 
derstand the origin of fluctuation in traffic process Q. 
They found that the fluctuation is dominantly driven by 
either internal or external dynamics of the complex sys- 
tem [6,]. In their studies, they found there is a power-law 
scaling for the dispersion and the average flux, namely 
a oc (/)", and there are two classes of universality for real 
systems. In the Internet and the computer chip there is 
robust internal dynamics which leads to the fluctuation 
exponent a = 1/2, while highway and Web traffic are 
driven by external demand which leads to the fluctua- 
tion exponent a = 1. Authors use a stylized model of 
random walkers throughout network, they thought what 
is probably one of the most important factors in the traf- 
fic dynamics on networks is the limited capacity of nodes 
to handle packets simultaneously, which leads to pack- 
pack interaction and induce large fluctuations or even 
network congestion. However, a recent study on scaling 
of fluctuation in internet traffic shows that the fluctua- 



tion is different from 1 /2 which was claimed in the above 
papers. They developed a model where the arrival and 
departure of "packets" follow exponential distribution, 
and the processing capability of nodes is either unlim- 
ited or finite was proposed by Duch and Arenas |7|| . This 
model presents a wide variety of exponents between 1/2 
and 1, revealing their dependence on the few parame- 
ters considered, and questioning the existence of univer- 
sality classes. Hence it seems that the universal classes 
of fluctuation scaling for network dynamics are far from 
reaching consensus and therefore it is worthy to further 
investigate what about the fluctuation behavior in other 
real networks. Neverthless, so far there are few analy- 
sis on the fluctuation behavior of other specific networks 
rather than the networks which have been investigated in 
Ref. [1, 0|- In this work, we will investigate the network 
evolution and fluctuation based on our previous study of 
the download network. 

In our previous work in 2004 Q, we reported, for the 
first time, that the download frequency of the papers in 
a web page is also a scale-free network. Its rank-ordered 
download distribution can be described by the Zipf law 
(ol flOl or Tsallis' non-extensive entropy [HI. The data 
set of the download rates comes from a well constructed 
web page in the field of economical physics (so-called 
Econophysics) by Zhang since 1998 121. Furthermore, 
the mechanism of network growth was explained by the 
preferential attachment network model of Barabasi and 
Albert. Since three years have passed after this net- 
work analysis, it is of interesting to see how this network 
evolves and how about the fluctuation of download rate. 

Firstly let us see some plots of rank distributions of 
the download numbers from the data on 2004/08/31 to 
2007/07/29 which is shown in Fig. [TJ Roughly, rank dis- 
tribution are almost linear in double logarithm plots and 
they can be described by the Zipf law @ 
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where the 7 is the Zipf law exponent. Zipf's law or scale 
free networks is different from the predictions of pur e 
random networks introduced by Erdos and Renyi [l3|. 
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FIG. 1: The rank-ordered (Zipf-typo) plot for the download num- 
bers of the papers in http:/ /www.unifr.ch/econophysics web page. 
The symbols are illustrated in figure. See text for details. In the 
insert of left bottom corner, it shows the evolution of Zipf exponent 
as a function of the days starting from 2004/08/31. 
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FIG. 2: Time evolution of download numbers which is sorted by 
the different ranks of the papers. The arrows in the upper axis 
illustrate three time points, namely 2005/03/13, 2005/12/26 and 
2007/03/26 from left to right) which will be used to investigate the 
time window effects afterwards. 



Roughly, the shapes of these distributions keep similar 
in all times. However, quantitative analysis shows non- 
constant behavior of the evolution of Zipf exponent (7) 
which is shown in the inset of the Figure 1. Especially 
there is a bump during 2006, i.e. the rank-ordered distri- 
butions tend to be steeper, which reflects higher down- 
load frequency for higher rank papers. However, the ex- 
ponent decreases in 2007, i.e. more flatter distribution, 
which is obviously seen in the Figure 1 (diamond points). 
In this case, the web visitors prefer to download more pa- 
pers listed in the web page which are not only focused on 
those top downloaded papers. 

To quantitatively see the increasing download num- 
bers with time, we make a plot in Fig. [5] for the rank- 
sorted download numbers AiV starting from the date 
2004/08/31 (i.e. AA^ = on 2004/08/31) as a function 
of days which passed starting from 2004/08/31. From 
the figure, all curves do not follow the exact linear in- 
creasing. In other words, there exists fluctuation for the 
download rates day by day. 

Quantitative fluctuation of the download rates can be 
described by the average download numbers per day, 
namely the average flux, (/) = j-^^j-, where ti is the 
time of the download day i (i.e., the abscissa of Fig. ^ 
and i from the starting date (2004/08/31) to the end- 
ing date which will be illustrated later. For each rank- 
ordered paper, these download rates change day by day, 
from which we can extract the average flux (/) and its 
dispersion a (root of mean square of the download rate 
distribution) for each paper. 

Fig. [3] shows the relationship between the average 
download rates (/) (left column) or the dispersion a 
(left column) as a function of the rank for the accumu- 
lated data during 2004/08/31 to 2005/03/13 (i.e. - 6.5 
months)(top row), from 2004/08/31 to 2005/12/26 (i.e. 
~ 31 months) (i.e. 16 months) (middle row) and from 
2004/08/31 to 2007/03/26 (i.e. - 31 months) (bottom 
row). In left columns, the download rates show a fast 



decay with the increasing of rank for those most down- 
loaded papers and the keep fluctuation for large rank 
values. It can be qualitatively understood that the web 
visitors prefer to download the top rank-ordered papers 
when he/she visits this page for the first time. This aver- 
age day-by-day download flux can be roughly described 
by the exponential decay flts: 
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which is plotted in the flgures and the half-lifetime decay 
exponent R is shown in the inset. R is small and seems to 
increase with the time period during which the data were 
accumulated. Right columns depict the dispersion as a 
function of the ranks which does not exhibit an obvious 
exponential decay as (/) versus rank shows, rather than 
frequent fluctuations. 

Fig. [4] demonstrates the relationship of (/} and a for 
all rank-ordered papers. To investigate the possible ef- 
fect of time window in which the download data were 
accumulated, we use the data ensembles which corre- 
spond to the period from 2004/08/31 to 2005/03/13 (a), 
from 2004/08/31 to 2005/12/26 (b) and from 2004/08/31 
to 2007/03/26 (c), respectively. From each double loga- 
rithm plot, all points basically show linear increases be- 
tween the average flux and its dispersion. In this context, 
we fit the data points using the power law: 



(3) 



to extract the scaling parameter a which are shown in 
the inset of each panel. There are two points which we 
can learn from the figure: (1) the scaling parameter a is 
neither 1/2 nor 1. In the work of Menezes and Barabasi, 
they thought there are two universal fluctuation classes: 
a ~ 1/2 or 1 systems. The typical example of the former 
is the Internet network, which was claimed to be domi- 
nantly driven by the internal dynamics. And the typical 
example of the latter is the WWW URL links, which was 
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FIG. 3: The average flux (/) (left column) or the dispersion a 
(left column) as a function of the rank for the accumulated data 
during 2004/08/31 to 2005/03/13 (top row), from 2004/08/31 to 
2005/12/26 (middle row) and from 2004/08/31 to 2007/03/26 (bot- 
tom row). The line in left panels represents the exponential fits 
using Eq[2] 
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FIG. 4: (T as a function of (/) for all rank-ordered papers in three 
accumulated time windows as illustrated in the figures. 



claimed to be dominantly driven by the external dynam- 
ics. The exponents of our download network are between 
1/2 and 1. (2) The scaling exponents seem to depend 
on the time windows during which the data samples are 
collected. In the other words, the longer the time win- 
dows, the larger the fluctuation exponent. Hence, in this 
viewpoint, we cannot exclude the possibility that fluctu- 
ation exponent could reach to 1 if we take very long time 
windows from the present work. 

Qualitatively, our network fluctuation could be ex- 
plained by the interplay of different fluctuation types, 
namely internal fluctuation and external fluctuation, 
which was proposed by Menezes and Barabasi In 
their model, they consider the random diffusion of W 
walkers (here they represent the visitors who download 
the papers) on the network, such that each walker that 
reaches a node i (here it represents the rank-ordered pa- 
per) departs in the next time step along one of other 
nodes. Originally each walker is placed on the network 
at a randomly chosen location and removed after it per- 
forms M steps, mimicking in a highly simplified fashion 
a human browser surfing the Web for information [6] . In 
this way, finally the relationship between the average flux 
and the fluctuations follow a fluctuation scaling with a — 
1/2, corresponding to internal fluctuation driven behav- 
ior. However, in real systems the fluctuation on a given 
node is determined not only by the system's internal dy- 



namics, but also by changes in the external condition. 
To incorporate externally induced fluctuation, Menezes 
and Barabasi allow W (the number of walkers in the 
web page), to vary from one day to the other. This is 
of course true, especially in case that peoples visit un- 
congested web page, such as the Encophysics web page. 
Assuming that the day to day variations of W{€) define 
a dynamic variable chosen from an uniform distribution 
in the interval \W-^W, W + AW], for AW = one 
recovers a = 1/2. However, when AW exceeds a cer- 
tain threshold, in both models the dynamical exponent 
changes to a = 1 [1]. In this case, the external fluctu- 
ation can overshadow the internal fluctuation so that a 
= 1. However, our network fluctuation behavior is not 
exactly the above extreme cases, instead it seems to be 
located in the transition region between the two extreme 
cases, namely the internal fluctuation type (a — 1/2) 
and external fluctuation type { a = 1). This could be 
explained by a smaller AW which does not exceed the 
certain threshold corresponding to a transition condition 
from a = 1/2 to 1 in our download network. We think 
this is reasonable since the Encophysics web page is not a 
popular web page, such as Yahoo or Sina web pages, in- 
stead it is a small-circle scientific web page and no many 
people browse it often. In this case, AW could be small 
due to a few people browse this web page day-by-day so 
that the day variations AW cannot exceed the certain 
threshold. Actually, there exists a wide transition region 
where a is between 1/2 to 1 for a finite network system 
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in Menezes and Barabasi's model. Therefore, our inter- 
pretation for the origin of the download network fluctua- 
tion is not contradicted to their model. Using the above 
scenario, we can qualitatively learn what the fluctuation 
originates from for our download network. 

However, the above scenario which assumes any ex- 
ternal driving force is not unique to interpret the ob- 
served crossover fluctuation exponents between 1/2 and 
1. A simple traffic model in complex networks that sug- 
gests congestion as the origin of the increase of a and 
captures the essential parameters governing the dynam- 
ical process 0] is also possible to explain the download 
network fluctuation. In that model, traffic process in a 
complex network of N nodes as N queue systems, and 
a random walk simulation for the movement of packets 
on the network. The arrival process of packets to the 
network is controlled by a Poisson distribution with pa- 
rameter A, each packet enters the network at a random 
selected node. Once the packet arrives to the node en- 
ters a queue. The delivery of the packets in the queue is 
controlled by an exponential distribution of service times 
with parameter fi. In that model, the packets will per- 
form S random steps in the network before disappearing. 
This dynamics is performed in continuous time, assuming 
that the time expended by packets traveling through a 
link is negligible. The model can flnally account for differ- 
ent scaling exponents a depending on the parameters A, 
/i, 5, and the time period P. Especially, we are interest- 
ing to see that a is a function of the time window length 
P in which the average was taken, which changes from 
1/2 to 1: a increases with the the time window length in 
the transition region. In the present study, even though 
our time window means the whole statistical one in which 
the download data were accumulated, which is different 
from the above mentioned time window in which the av- 



erage were taken in the above model, the effect could be 
analogous: the larger accumulated time windows can be 
somewhat equivalent to the larger time window length P 
in which the average were taken. The same trend which 
a increases with time was observed. Actually, the fluctu- 
ation exponents between 1/2 and 1 have been observed in 
the stock market transaction and other human dynamics 
such as emails from a particular company and data on 
the printing activity etc, and their exponent shows the 
dependences on the time window size. A detailed review 
can be found in Ref . . 

In summary, we investigated the evolution of the down- 
load network for the rank-ordered papers which were 
listed in Zhang's Encophysics web page. In recent three 
years, the download distribution shows the change of the 
exponents even though the rank-ordered distribution still 
keeps scale-free feature, reflecting the change of traffic on 
nodes which represent the given downloaded papers. Fur- 
ther, we give quantitative analysis for the average down- 
load rates (/) per day, which show day-by-day fluctua- 
tion. The average flux shows a fast exponential decay 
as a function of the rank, while the dispersion does not 
show an obvious dependence of the rank. Interestingly, 
the dispersion of the download rate distributions shows a 
power-law scaling behavior with its average flux, namely 
a oc (/)". In different time windows ranging from about 
6.5 months to 31 months in which the download distribu- 
tions are accumulated, the scaling parameter a changes 
with the time windows, namely from 0.60 to 0.89. The 
origins are qualitatively interpreted by two models. Fu- 
ture work on quantitative model simulation and a possi- 
ble A-scaling of network fluctuation is in progress. 
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